Potential Information Maximization: Potentiality-Driven Information Maximization and Its Application to Tweets Classification and Interpretation

نویسندگان

  • Ryozo Kitajima
  • Ryotaro Kamimura
  • Osamu Uchida
  • Fujio Toriumi
چکیده

The present paper aims to apply a new informationtheoretic learning method called “potential information maximization” to the classification and interpretation of tweets. It is well known that social media sites such as Twitter play a crucial role in transmitting important information during natural disasters. In particular, since the Great East Japan Earthquake in 2011, Twitter has been considered as one of the most efficient and convenient communication tools. However, since there is much redundant information contained in tweets, it is critical that methods be developed to extract only the most important information from them. To cope with complex and redundant data, a new neural information-theoretic learning method has been developed for this purpose. The method aims to find neurons with high potential and maximize their information content to reduce redundancy and to focus on important information. The method was applied to real tweet data collected during the earthquake. It was found that the method could classify the tweets as important and unimportant more accurately than other conventional machine learning methods. In addition, the method made it possible to interpret how the tweets could be classified based on the examination of highly potential neurons.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Throughput Maximization for Multi-Slot Data Transmission via Two-Hop DF SWIPT-Based UAV System

In this paper, an unmanned aerial vehicle (UAV) assisted cooperative communication system is studied, wherein a source transmits information to the destination through an energy harvesting decode-and-forward UAV. It is assumed that the UAV can freely move in between the source-destination pair to set up line of sight communications with the both nodes. Since the battery of the UAV may be limite...

متن کامل

A New GIS based Application of Sequential Technique to Prospect Karstic Groundwater using Remotely Sensed and Geoelectrical Methods in Karstified Tepal Area, Shahrood, Iran

In this research, recognition of karstic water-bearing zones using the management of exploration data in Kal-Qorno valley, situated in the Tepal area of Shahrood, has been considered. For this purpose, the sequential exploration method was conducted using geological evidences and applying remote sensing and geoelectrical resistivity methods in two major phases including the regional and local s...

متن کامل

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

Unification of Information Maximization and Minimization

In the present paper, we propose a method to unify information maximization and minimization in hidden units. The information maximization and minimization are performed on two different levels: collective and individual level. Thus, two kinds of information: collective and individual information are defined. By maximizing collective information and by minimizing individual information, simple ...

متن کامل

Robust Method for E-Maximization and Hierarchical Clustering of Image Classification

We developed a new semi-supervised EM-like algorithm that is given the set of objects present in eachtraining image, but does not know which regions correspond to which objects. We have tested thealgorithm on a dataset of 860 hand-labeled color images using only color and texture features, and theresults show that our EM variant is able to break the symmetry in the initial solution. We compared...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016